Safety-Aware Apprenticeship Learning
نویسندگان
چکیده
Apprenticeship learning (AL) is a class of “learning from demonstrations” techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert’s demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure both safety and performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.
منابع مشابه
Comparison of Students’ Perception of Preparedness for Interprofessional learning readiness in apprenticeship and apprenticeship on site in Schools of Nursing and Midwifery of Islamic Azad Universities in Isfahan, Iran in 2018
Background & Objective: Interprofessional education (IPE) is one of the new approaches in the education of students in health-related disciplines. This type of training can increase interprofessional collaborations, thereby improving patient care quality. This study aimed to compare the perception of IPE in students apprenticeship and apprenticeship on site in schools of nursing and midwifery o...
متن کاملEarly Start in Software Coaching
The demand for software coaching and coaches is increasing. As our programming courses are organized according to the Extreme Apprenticeship method, it is relatively safe and straightforward to allow students to participate as coaches in our CS1 course even as early as their second semester. Safety is ensured by the hierarchical structure of CS1 course personnel that provides enough peer and fa...
متن کاملSituating Learning in the Workplace: Having Another Look at Apprenticeships
This article examines the acquisition of vocational skills through apprenticeship-type situated learning. Findings from a studies of skilled workers revealed that learning processes that were consonant with the apprenticeship model of learning were highly valued as a means of acquiring and maintaining vocational skills. Supported by current research and theorising, this article, describes some ...
متن کاملGeneralizing Apprenticeship Learning across Hypothesis Classes
This paper develops a generalized apprenticeship learning protocol for reinforcementlearning agents with access to a teacher who provides policy traces (transition and reward observations). We characterize sufficient conditions of the underlying models for efficient apprenticeship learning and link this criteria to two established learnability classes (KWIK and Mistake Bound). We then construct...
متن کاملHierarchical Apprenticeship Learning with Application to Quadruped Locomotion
We consider apprenticeship learning—learning from expert demonstrations—in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert demonstrate complete trajectories through the domain. However, in many problems even an expert has difficulty controlling the system, which makes this approach infeasible. For example, consider the task of teaching a quad...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1710.07983 شماره
صفحات -
تاریخ انتشار 2017